LPC, LPCC and MFCC parameterisation applied to the detection of voice impairments

نویسندگان

  • Juan Ignacio Godino-Llorente
  • Santiago Aguilera-Navarro
  • Pedro Gómez Vilda
چکیده

There is an increased risk for vocal and voice diseases due to the modern way of life. It is well known that most of the vocal and voice diseases cause changes in the acoustic voice signal. These diseases have to be diagnosed and treated during an early stage. Acoustic analysis is a non-invasive technique based on digital processing of speech signal. Acoustic analysis can be a useful tool to diagnose this kind of diseases, furthermore it presents several advantages: it is a non-invasive tool, an objective diagnostic and, also, it can be used for the evaluation of surgical and pharmacological treatments and rehabilitation processes. ENT clinicians use acoustic voice analysis to characterise pathological voices. In this paper, we study threee well known parameterisation approaches applied to the automatic detection of voice disorders. Former and actual works demonstrate that impaired voice detection can be carried out by means of supervised neural nets: MLP (Multilayer perceptron). We have focused our task in detection of impaired voices by means of neural network technology (ANN) and parameters such a LPC, LPCC and MFCC extracted from the voice signal. The performance of the neural network based detector is compared with that using acoustic parameters such a Fo, NHR, NNE, Shimmer, Jitter... as input variables. The aim of this paper is to study and compare those widely used parameterisation method in speech technology applied to the detection of impaired voices.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automatic Speaker Recognition using LPCC and MFCC

A person's voice contains various parameters that convey information such as emotion, gender, attitude, health and identity. This report talks about speaker recognition which deals with the subject of identifying a person based on their unique voiceprint present in their speech data. Pre-processing of the speech signal is performed before voice feature extraction. This process ensures the voice...

متن کامل

A Survey on Speech Recognition Algorithms

peaker recognition is a process where a person is recognized on the basis of his/ her voice signals. Human voice is aunique characteristic for any individual.Speaker recognition is being applied in biometric identification, security related areas, remote access to computers etc.This paper delivers an overview of different techniques that can be used in application of speaker recognition such as...

متن کامل

An investigation of cepstral parameterisations for large vocabulary speech recognition

We examined variants of MFCC and PLP cepstral parameterisations in the context of large vocabulary continuous speech recognition under di erent acoustical environmental conditions: Compared to MFCC, mel-frequency PLP uses a cubic root intensity-toloudness law, and an LPC analysis is applied to the mel-warped spectrum. In LPC-smoothed MFCC, the only di erence to MFCC is the additional LPC smooth...

متن کامل

LPC and MFCC Analysis of Assamese Vowel Phonemes

A speech signal contains many levels of information. Speech conveys the information about the language being spoken, the emotion, gender, and the identity of the speaker. Features parameters extracted from speech are very useful for speaker recognition as well as speech recognition. In this paper, the features LPC and MFCC are computed of Assamese vowel phonemes which will be helpful to develop...

متن کامل

GMM Classifier for Identification of Neurological Disordered Voices Using MFCC Features

Automatic detection of neurological disordered subjects voice mostly relies on parameters extracted from time-domain processing. The calculation of these parameters often requires prior pitch period estimation; which in turn depends heavily on the robustness of pitch detection algorithm. In the present work cepstraldomain processing technique which does not require pitch estimation has been ado...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2000